Discriminative bag-of-cells for imaging-genomics.

نویسندگان

  • Benjamin Chidester
  • Minh N Do
  • Jian Ma
چکیده

Connecting genotypes to image phenotypes is crucial for a comprehensive understanding of cancer. To learn such connections, new machine learning approaches must be developed for the better integration of imaging and genomic data. Here we propose a novel approach called Discriminative Bag-of-Cells (DBC) for predicting genomic markers using imaging features, which addresses the challenge of summarizing histopathological images by representing cells with learned discriminative types, or codewords. We also developed a reliable and efficient patch-based nuclear segmentation scheme using convolutional neural networks from which nuclear and cellular features are extracted. Applying DBC on TCGA breast cancer samples to predict basal subtype status yielded a class-balanced accuracy of 70% on a separate test partition of 213 patients. As data sets of imaging and genomic data become increasingly available, we believe DBC will be a useful approach for screening histopathological images for genomic markers. Source code of nuclear segmentation and DBC are available at: https://github.com/bchidest/DBC.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Supervised Topic Models for Video Activity Recognition

Topic models successfully capture latent structure useful for unsupervised analysis of bag-of-words data. Applying these models to domains such as video activity recognition requires two critical extensions: (1) incorporating supervised information (activity labels) to recover topic structure with greater discriminative power and (2) moving beyond the bag-of-words assumption to model temporal d...

متن کامل

Multi-instance Learning with Discriminative Bag Mapping

Multi-instance learning (MIL) is a useful tool for tackling labeling ambiguity in learning because it allows a bag of instances to share one label. Bag mapping transforms a bag into a single instance in a new space via instance selection and has drawn significant attention recently. To date, most existing work is based on the original space, using all instances for bag mapping, and the selected...

متن کامل

INAOE's Participation at PAN'15: Author Profiling task

In this paper, we describe the participation of the Language Technologies Lab of INAOE at PAN 2015. According to the Author Profiling (AP) literature. In this paper we take such discriminative and descriptive information into a new higher level exploiting a combination of discriminative and descriptive representations. For this we use dimensionality reduction techniques on the top of typical di...

متن کامل

ژنومیکس انگل ها

Genes carry instructions to make protein that affect body's cells and their physical activity. They also play an important role in the occurrence of various characteristics in the body. Recently, scientists in the new field of science known as genomics have studied the genetic instructions. Genomics deals with the discovery of all the sequences in the entire genome of organisms and is used to s...

متن کامل

Onm-21: General Principles of Collecting and Storing Cord Blood Stem Cell

Cord blood is the blood that remains in the umbilical cord and placenta following birth, which is usually discarded It contains red blood cells, white blood cells, platelets, and plasma, like blood. In addition, cord blood is a rich source of stem cells that may have potentially lifesaving benefits for your baby and family. The cord blood of baby serves as an abundant source of stem cells. Thes...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing

دوره 23  شماره 

صفحات  -

تاریخ انتشار 2018